Logistic regression and machine learning predicted patient mortality from large sets of diagnosis codes comparably

نویسندگان

چکیده

Objective The objective of the study was to compare performance logistic regression and boosted trees for predicting patient mortality from large sets diagnosis codes in electronic healthcare records. Study Design Setting We analyzed national hospital records official death patients with myocardial infarction (n = 200,119), hip fracture 169,646), or colorectal cancer surgery 56,515) England 2015–2017. One-year predicted age, sex, socioeconomic status, 202 257 International Classification Diseases 10th Revision recorded preceding year not (binary predictors). Performance measures included c-statistic, scaled Brier score, several calibration. Results 17.2% (34,520) after infarction, 27.2% (46,115) fracture, 9.3% (5,273) surgery. Optimism-adjusted c-statistics models were 0.884 (95% confidence interval [CI]: 0.882, 0.886), 0.798 (0.796, 0.800), 0.811 (0.805, 0.817). equivalent tree 0.891 CI: 0.889, 0.892), 0.804 (0.802, 0.806), 0.803 (0.797, 0.809). Model also similar when measured using scores. All well calibrated overall. Conclusion In datasets records, numerous comparably.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computational techniques for spatial logistic regression with large data sets

In epidemiological research, outcomes are frequently non-normal, sample sizes may be large, and effect sizes are often small. To relate health outcomes to geographic risk factors, fast and powerful methods for fitting spatial models, particularly for non-normal data, are required. I focus on binary outcomes, with the risk surface a smooth function of space, but the development herein is relevan...

متن کامل

Mortality risk prediction in burn injury: Comparison of logistic regression with machine learning approaches.

INTRODUCTION Predicting mortality from burn injury has traditionally employed logistic regression models. Alternative machine learning methods have been introduced in some areas of clinical prediction as the necessary software and computational facilities have become accessible. Here we compare logistic regression and machine learning predictions of mortality from burn. METHODS An established...

متن کامل

Extreme Logistic Regression: A Large Scale Learning Algorithm with Application to Prostate Cancer Mortality Prediction

With the recent popularity of electronic medical records, enormous amount of medical data is being generated every day at an exponential rate. Machine learning methods have been shown in many studies to be capable of producing automatic medical diagnostic models such as automated prognostic models. However, many powerful machine learning algorithms such as support vector machine (SVM), Random F...

متن کامل

Efficient Online Learning for Large-Scale Sparse Kernel Logistic Regression

In this paper, we study the problem of large-scale Kernel Logistic Regression (KLR). A straightforward approach is to apply stochastic approximation to KLR. We refer to this approach as non-conservative online learning algorithm because it updates the kernel classifier after every received training example, leading to a dense classifier. To improve the sparsity of the KLR classifier, we propose...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Clinical Epidemiology

سال: 2021

ISSN: ['1878-5921', '0895-4356']

DOI: https://doi.org/10.1016/j.jclinepi.2020.12.018